An improved speech segmentation quality measure: the r-value
نویسندگان
چکیده
Phone segmentation in ASR is usually performed indirectly by Viterbi decoding of HMM output. Direct approaches also exist, e.g., blind speech segmentation algorithms. In either case, performance of automatic speech segmentation algorithms is often measured using automated evaluation algorithms and used to optimize a segmentation system’s performance. However, evaluation approaches reported in literature were found to be lacking. Also, we have determined that increases in phone boundary location detection rates are often due to increased over-segmentation levels and not to algorithmic improvements, i.e., by simply adding random boundaries a better hit-rate can be achieved when using current quality measures. Since established measures were found to be insensitive to this type of random boundary insertion, a new R-value quality measure is introduced that indicates how close a segmentation algorithm’s performance is to an ideal point of operation.
منابع مشابه
A new quality measure for topic segmentation of text and speech
The recent proliferation of large multimedia collections has gathered immense attention from the speech research community, because speech recognition enables the transcription and indexing of such collections. Topicality information can be used to improve transcription quality and enable content navigation. In this paper, we give a novel quality measure for topic segmentation algorithms that i...
متن کاملA Hybrid 3D Colon Segmentation Method Using Modified Geometric Deformable Models
Introduction: Nowadays virtual colonoscopy has become a reliable and efficient method of detecting primary stages of colon cancer such as polyp detection. One of the most important and crucial stages of virtual colonoscopy is colon segmentation because an incorrect segmentation may lead to a misdiagnosis. Materials and Methods: In this work, a hybrid method based on Geometric Deformable Models...
متن کاملAn improved joint model: POS tagging and dependency parsing
Dependency parsing is a way of syntactic parsing and a natural language that automatically analyzes the dependency structure of sentences, and the input for each sentence creates a dependency graph. Part-Of-Speech (POS) tagging is a prerequisite for dependency parsing. Generally, dependency parsers do the POS tagging task along with dependency parsing in a pipeline mode. Unfortunately, in pipel...
متن کاملComparing the level of stress, depression and quality of life in mothers of children with speech and language disorders with emphasis on the duration of receiving rehabilitation services
Objective: Having a child with speech and language disorders, while creating a sense of great therapeutic responsibility towards them, will significantly affect the psychological dimensions of parents, especially mothers as the primary caregiver. They are more exposed to the pressures and stress that result from the sense of responsibility towards their children. The aim of this study was to co...
متن کاملPartial Differential Equations applied to Medical Image Segmentation
This paper presents an application of partial differential equations(PDEs) for the segmentation of abdominal and thoracic aortic in CTA datasets. An important challenge in reliably detecting aortic is the need to overcome problems associated with intensity inhomogeneities. Level sets are part of an important class of methods that utilize partial differential equations (PDEs) and have been exte...
متن کامل